A-PRIORI BOUNDS FOR THE 1-D CUBIC NLS IN 
NEGATIVE SOBOLEV SPACES 



HERBERT KOCH AND DANIEL TATARU 

Abstract. We consider the cubic Nonlinear Schrodinger Equation (NLS) 
in one space dimension, either focusing or defocusing. We prove that the 
solutions satisfy a-priori local in time H s bounds in terms of the H s size of 
the initial data for s > — i. 

— 6 



1. Introduction 

The one dimensional cubic Nonlinear Schrodinger equation (NLS) 

(1) iu t — u xx ± u\u\ 2 = 0, w(0) = Wo- 

arises as generic asymptotic equation for modulated wave trains. Its has a 
particularly rich structure: It is Hamiltonian with respect to the symplectic 
structure 

and the Hamiltonian 

-(u'f ±-\u\ A dx. 

There are infinitely many conserved quantities. The NLS equation is com- 
pletely integrable in the sense that there exist Lax pairs for it. The machinery 
of inverse scattering allows to construct many interesting solutions, among 
them solitary waves in the focusing case. 

The NLS is globally well-posed for initial data u$ G L 2 , and locally in time 
the solution has a uniform lipschitz dependence on the initial data in balls. 

On the other hand (pQ) is invariant with respect to the scaling 

u(x,t) — > \u(\x, \ 2 t) 

This implies that the scale invariant initial data space for ([1]) is H~z. Thus 
one is motivated to ask whether the local well-posedness also holds in negative 
Sobolev spaces. 

The equation (CQ) is also invariant under the Galilean transformation 

u(x,t) ^ e icx - lcH u(x + 2ct,t) 
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which corresponds to a shift in the frequency space. As a consequence there is 
no uniformly continuous dependence on the initial data (see [BJ, [5]). This is 
not unexpected; if local uniformly continuous dependence were to hold in any 
negative Sobolev space, by Gallilean invariance and scaling this would imply 
global in time local in space uniformly continuous dependence on the initial 
data in L 2 . 

What we expect below L 2 is for the cubic NLS to exhibit genuinely nonlinear 
dynamics, which corresponds to a continuous but not uniformly continuous 
dependence on the initial data. One may be tempted to think that local well- 
posedness should hold all the way down to s — — ~. However, such a result is 
far out of reach for now and we would not even speculate whether it is true or 
not. 

On the other hand, there is another very natural threshold, which is con- 
nected to the main motivation of the present paper. In a recent paper Kappeler 
and Topalov [5] proved that the mKdV equation 

V t ~ Vxxx + v x v 2 = 0, v o = v o 

on the torus is well-posed for initial data in L 2 . The proof relies on complete 
integrability of the equation, and it uses the machinary of integrable equations 
in a fundamental way. One may ask whether the same result holds on the real 
line, and also whether it is possible to find arguments which do not use the 
integrable structure. 

To connect this problem with the NLS equation we consider modulated 
wave train solutions v of the form v = §lw where w is frequency localized in a 
neighborhood of size h of some large frequency A. Then w solves the equation 

w t +i\ 3 w+3i\ 2 {D x -\)w+3i\{D x -\) 2 w+3i\w\w\ 2 ^ 0{h 3 )w+0{h)w\w\ 2 +w 3 

For li <C A we neglect the first two terms on the right. The w 3 term is non- 
resonant and is also neglected. Then the substitution 

w(t, x) = \-h~ ixSt e tXx u(t, A-3 (x - 3\ 2 t)) 
turns the above equation into ([1]) with modified constants. 

3 

A frequency range of size A for u turns into a frequency range of size \i = A^ 
for w. By construction this frequency range for w is centered at the origin, 
but we can use a Galilean transformation to shift it to a dyadic region. We 
can also easily compute 

\\u(o)\\v = a* Ho) |Ua = ^lh>(o)IU» 

Hence the mKdV equation with initial data in L 2 is similaiQ to the NLS equa- 
tion with initial data in H~&. We view the one dimensional NLS equation as 

*We emphasize that this similarity applies only for solutions in a dyadic frequency range. 
On the other hand in our analysis later in the paper we see that some of the most difficult 
to control multilinear interactions occur in the case of unbalanced frequencies, where this 
analogy no longer applies. 
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a simpler model in the analysis of the KdV equation; this is due to the added 
Gallilean invariance. However, it is also interesting in its own right. 

The threshold s = — | also arises in several key steps of our analysis later 
on, having to do with the interaction of high and low frequencies. We are led 
to 

Conjecture 1. The cubic NLS equation (TjQ) is locally well-posed for initial 
data in H s with s > — \. 

To prove this one would need to establish a-priori H s bounds for the solu- 
tions and then prove continuous dependence on the initial data. In this article 
we solve the easier half of this problem. 

Theorem 1. Let s > — |. For any M > there exists T > and C > so 
that for any initial data u G L 2 satisfying 

\\uo\\h s < M 

there exists a solution u G C(0,T;L 2 ) to ([T]) which satisfies 

\\u\\l°°h° < C||mo||^ 

While writing this paper the authors have learned that similar results were 
independently obtained by Christ-Colliander-Tao [2j. Their results apply in 
the range s > — ^. 

We also refer the reader to the work of Vargas- Vega [TDJ and Griinrock p[] 
who consider the cubic NLS in alternative function spaces below L 2 , but only 
in settings where the local Lipschitz dependence on the initial data still holds. 

Remark 1.1. In the process of proving the theorem we actually obtain a better 
characterization of the solution u, namely we show that u bounded in a space 
X s defined in the next section which embeds into L°°H S and has the property 
that the nonlinear expression \u\ 2 u is well defined for u G X s with a bound 
depending only on the H s norm of the initial data. 

We note that by rescaling the problem reduces to the case of small initial 
data. Then we take M = e, small and T = 1, C = 2. 

We begin with a dyadic frequency decomposition of the solution u, 

u = ^u x 

x 

To measure the H s norm of u we use the stronger norm than L°°(H S ), 

\\u\\f2 LooHB = VsupA 2s ||M A (t)||2 2 
x 1 

That we can use this instead of the L°°H S norm is a reflection of the fact that 
there is not much energy transfer between different dyadic frequencies. 

To prove the theorem we need two Banach spaces X s and Y s , defined in the 
next section, in order to measure the regularity of the solution u, respectively 
of the nonlinear term \u\ 2 u. 
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The linear part of the argument is given by 
Proposition 1.2. The following estimate holds: 

\\ u \\x° < |M|pz,°°h» + \\iut - Am|| fs 



To estimate the nonlinearity we need a cubic bound, 
Proposition 1.3. Let — | < s < and u £ X s . JTien |m| 2 u £ F s and 

IIM^IIy* < IMIx« 

Finally we need to propagate the if s norm: 
Proposition 1.4. Lei — | < s < 0, and u be a solution to (pQ) wift 

llwllpiooifs 1. 

T/ien we /jai>e 

The plan of the paper is as follows. In the next section we motivate and intro- 
duce the spaces X s and Y s , as well as establish the linear mapping properties 
in Proposition 11.21 In Section [3] we discuss the linear and bilinear Strichartz 
estimates for solutions to the linear equation. 

The trilinear estimate in Proposition 11.31 is proved in Section HI Finally 
in the last section we use a variation of the I-method to construct a quasi- 
conserved energy functional and compute its behavior along the flow, thus 
proving Proposition 11.41 

To conclude this section we show that the conclusion of the Theorem follows 
from the above Propositions. We first note that if uq £ L 2 then by iteratively 
solving the equation on small time intervals we obtain a solution u up to time 
1, which satisfies 

(2) iu t — An £ L 2 

This easily implies that u £ l 2 L°°H s , and also that u £ X s . 

To prove the theorem we use a continuity argument. Let e > be a small 
constant and suppose that ||mo||h s (r) < e - Fi x a small threshold 5, e 5 <^ 1 
and denote by A the set 

A = {T e [0, 1]; |H|pi/x> ij«([o,T]xR) < 25, ||w||x<»([o,t]xir) < 2(5} 

We claim that A = [0, 1]. To show this we first observe that £ A. The 
norms above increase with T, therefore A is an interval. We show that A is 
both open and closed in [0, 1]. 
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By ([2]) it easily follows that the norms in the definition of A are continuous @ 
with respect to T. This implies that A is closed. 
Finally let T G A. By Proposition 11.41 we obtain 

IM|z 2 L°°/f" s ([0,T]xR) ^ £ + £ 3 - 

Then by Propositions 11.21 11.31 we obtain 

IM|xs([0,T]xR) < e + <5 3 

If e and S are chosen to be sufficiently small we conclude that 

IMIz 2 L°°H s ([0,T]xR) < S, |MU s ([0,T]xR) < 8 

By the continuity of the norms with respect to T it follows that a neighborhood 
of T is in A. 

Hence A = [0, 1] and the Theorem [1] is proved. 

2. The function spaces 

To understand what to expect in terms of the regularity of u we begin with 
some heuristic considerations. If the initial data u to (CQ) satisfies ||wo||l 2 < 1 
then the equation can be solved iteratively using the Strichartz estimates. We 
obtain essentially linear dynamics, and the solution u belongs to the space 
X ' 1 associated to the Schrodinger equation (see the definition in ([3]) below). 

Let s < 0. Consider now the same problem but with initial data Uq G H s , 
localized at frequency A. Then the initial data satisfies ||uo||z, 2 ^ A~ s . By 
rescaling we conclude that the evolution is still described by linear dynamics 
up to the time A 4s . 

Then it is natural to consider a dyadic decomposition of the solution u 

A 

and to measure the U\ component uniformly in X 4s time intervals. We remark 
that this is reasonable for as long as there is not much input coming from the 
higher frequencies. This is the technical point where the s = — | threshold 
arises in our proof. 

A good candidate for measuring u\ in A 4s time intervals is given by Bour- 
gain's X s,b spaces defined by 

(3) ll*..» = / Hr,0\ 2 e(l + \r-e\) 2b d^dr 

where the natural choice for b from a scaling standpoint is b = |. However, 
this choice leads to logarithmic divergences in estimates, so one commonly 
uses instead some b > | but close to it. We could do this here but it would 
complicate the bookkeeping and would also not work at s = — |. For b — \ 
one can go one step further and consider dyadic decompositions with respect 



This of course depends on the definition of the X s norm, but it is straightforward to 
prove. 
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to the modulation r — £ 2 . This leads to the additional homogeneous Besov 
type norms 

N^l,. = sjp ^ |«(r,0re 2 1r-e 2 M^ry 

Instead in this paper we use the closely related spaces U\ and V\. Spaces of 
this type have been first introduced in unpublished work of the second author 
on wave- maps, but in the meantime they have been also used in [TJ, p], [8]. 
They turn out to be useful replacements of X s ' b spaces in limiting cases, and 
they retain the scaling of the corresponding space of homogeneous solutions 
to the linear equation. We define them and summarize their key properties in 
what follows. 

Definition 2.1. Let 1 < p < oo. Then U^ is an atomic space, where atoms 
are piecewise solutions to the linear equation, 

U = J2 Mt k ,t k+1 )e itD *u k , ll Mfc lli 2 = 1 

k k 

and {tk} is an arbitrary increasing sequence. 
Clearly we have 

Ul C L°°L 2 

In addition, the functions are continuous except at countably many points, 
and right continuous everywhere. 

A close relative is the space V£ of functions with bounded p-variation along 
the flow: 

Definition 2.2. Let 1 < p < oo. Then is the space of right continuous 
functions u G L°°(L 2 ) for which the following norm is finite, 

\W\\ P v v = IMIW + sup Y^\\e^ D *u{t k ) - ^ + i°l u (t k+l )\\ L * 

where the supremum is taken with respect to all increasing sequences {tk}- 

Conjugation with the Schrodinger group reduces a large part of the study 
of the spaces V p and U p to the scalar case, where we replace the group by the 
identity. 

We have the series of inclusions 

(4) Ul C VI C Ul C L°°L 2 , p < q. 

The inclusion U p C V p can easily checked on atoms. The imbedding V p C U q 
is a little harder and its proof can be found in Section 5 of [7J . 
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We denote by DU A the space of functions 

DU P A = m - d 2 x )u; ueUl) 
with the induced norm. Then we have the trivial bound 

(5) Nk^lKo)||L2 + ||(^-^HI^ 

Finally, we have the duality relation 

(6) (DUD* = V*. 
To see this one first verifies the inequality 

((id t - d 2 x )f,g) L 2dx\ < \\f\\up\\g\\ V P> 



/< 



by checking it for atoms /. Secondly, given L G (DU A )* we apply it to 
characteristic functions of intervals, which allows to define a function g with 



/ 



((id t -d 2 x )f,g) L *dt = L((id t -d 2 x )f). 



An application to suitable atoms shows that g e V A . 
Moreover we have the embedding 

X ^ 1 C U 2 A . 

To see this it suffices to consider a function the Fourier transform of which 
is supported in a fixed dyadic annulus. The statement follows now easily. 
Combined with duality one sees that 

(7) X '^ 1 cU 2 A cVlc 

The U A and V A spaces behave well with respect to sharp time truncations. 
Precisely, if J is a time interval and xi is its characteristic function then we 
have the multiplicative mapping properties 

(8) Xi ■■ Ul -> U P A , xi ■■ VI -> V p 

with uniform bounds with respect to /. 

We use a spatial Littlewood-Paley decomposition 



1= Yl u= Yl p ^ u 



A>1 dyadic A>1 dyadic A>1 dyadic 

as well as a Littlewood-Paley decomposition with respect to the modulation 

r - e, 

A>1 dyadic 

Both decompositions are inhomogeneous. It is easy to verify that we have the 
uniform boundedness properties 

(9) Px : Ul -> U P A , Q X :U P A ^ U P A 
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and similarly for V%. 

For functions at frequency A we introduce a minor variation of the U\, 
respectively spaces, which we denote by U 2 , respectively V 2 . Their norms 
are defined as 

= ||Q< A 2M A ||^2 + ^ \\Xi(t)xj(x)Q>\aUx\\ua, 
l/^A-a.lJ^A" 1 

respectively 

IK||y2 = ||<5<A2Ma||^2 + ^ \\Xl(t)Xj(x)Q> X 2U X \\v2, 

|/|=A-2,|J|=A- 1 

Here the time truncation is still sharp, as above. The spatial truncation may 
be taken sharp or smooth, the two norms are equivalent due to the frequency 
localization. In the last norm we use the simpler space U 2 (where we replace 
A in Ua by zero) instead of this is also immaterial, the U 2 and U\ norms 
are equivalent at frequency A and modulation > A 2 . 

In doing this the U\ norm is slightly weakened, but only in the elliptic 
region: 

IKHt/2 < \\u\\\ui 

To see this it suffices to consider U\ atoms. Steps tj.+i — tk of size larger than 
A~ 2 are essentially canceled by the modulation localization operator Q>\2, 
therefore is suffices to restrict ourselves to the A -2 time scale. But on this 
scale the Schrodinger flow at frequency A is trivial, i.e. there is no propagation. 
Thus one obtains the square summability with respect to the A~ x spatial scale. 

Since we preserve the duality relation (jfJJ) the V£ norm is slightly strength- 
ened: 

ll M A||y A 2 > HwaIIvi 

The only advantage in using the modified spaces is that they allow us to 
replace a logarithm of the high frequency by a logarithm of the low frequency 
in ( I17|) . which is needed in order for our proofs to work in the limiting case 

We note that the inclusions in (j3J as well as the properties (jSJ), dSD, 0, @ 
and remain valid in the dyadic setting for the modified spaces. 

Now we are ready to introduce the function spaces for the solutions u. We 

set 

(10) \\ U \\ 2 XS =X^\ 2s S up ||x/Ma||^2 

where we sum over all dyadic integers > 1 with the obvious modification at 
A = l. 

To measure the regularity of the nonlinear term we need 

(11) ll/ll^ = E A2s su p WxihWlui 

A \I\=^ 

Due to ([5]) we easily obtain the bound in Proposition 11.21 

8 



3. Linear and bilinear estimates 

We begin with solutions to the homogeneous equation, 

(12) iv t -Av = 0, u(0) = v 

These satisfy the Strichartz estimates: 

Proposition 3.1. Let p,q be indices satisfying 

2 11 

13 - + - = -, 4<p<oo 

p q 2 

Then the solution u to (fl2"l) satisfies 

\\ v \\l*lI < IkolU 2 

In particular we note the pairs of indices (oo, 2), (6, 6) and (4, oo). On occa- 
sion it is convenient to interchange the role of the space and time coordinates. 
Then by interpolating the local smoothing estimate for solutions to (fT2"j) . 

IMU-L 2 ^ a~ 1/2 iKo||l2 

and the maximal function estimate 

IM|l4 L? o < A 1/4 ||w a ,o||l2 

we obtain 

Proposition 3.2. Letp,q be indices satisfying (flBl . Then for every solution 
v to (fi2l) which is localized at frequency A we have 

3 — 1 

IMIlSl? ^ 2 ||v A ,olU 2 
As a straightforward consequence we have 
Corollary 3.3. a) Letp,q be indices satisfying ( TT3|) . Then 

\Ml?l% < Nlu* 

and tne same holds with replaced by V\. 

b) In addition, if v is is localized at frequency A then we have 

WAlui ~ \\ v \\u p A i 

and the same holds with replaced by if p > 2. 

c) For v localized at frequency A the and norms in (a), (b) can be 
replaced by U\ and V£. 

The proof is straightforward, since it suffices to do it for atoms. In the case 
of V£ we also take advantage of the inclusion C p > 2. The estimate 
for V"l and \J\ follows from the embeddings U\ C V£ C V£ for functions at 
frequency A. 

By duality we also obtain 
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Corollary 3.4. a) Letp,q be indices satisfying f fT3|) . Then 



Mdu* £ \M rv'ri 



b) In addition, if v is is localized at frequency A then we have 



3 1 



\v\\du1^^ p 2 IMI l p' l9 ', forp>2. 



c) For v localized at frequency A the DU\ norm in (a), (b) can be replaced by 
DUl 

The second type of estimates we use are bilinear: 

Proposition 3.5. Let A > 0. Assume that u,v are solutions to (1121) which 
are A separated in frequency. Then 

( 14 ) HHU 2 ~ a^||m ||l2|KIIl2 

Proof. In the Fourier space we have 
Then 



which gives 



where £i and £2 are the solutions to 
We have 

drde = 2iei-6i^ide 2 

therefore we obtain 

The conclusion follows. □ 

As a consequence we obtain 

Corollary 3.6. a) Let u,v be functions which are A separated in frequency. 
Then 

(!5) IIHU 2 ^ A ~^IMIt/£lMlt/| 

Let \ <^ fi. Then 

( 16 ) IK^II^ ~ ^"'IM^IM^ 
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Again it suffices to prove these estimates for atoms, and then for solutions to 
the homogeneous Schroder equation. But this follows from the L 4 Strichartz 
estimates and the bilinear estimate of Proposition 13.51 

At a single point in the paper we need a version of ( fl6l) with U 2 replaced 
by V 2 . This is the only place where we use the V 2 modification of V\. 

Proposition 3.7. Let X <ti fx and \I\ = 1. Then 

(17) ||x/m a ^|| L 2 < yT* lnAHuAll^lMI^ 

We note that in order to treat the limiting case s — — | it is acceptable to 
loose In A, but not In /i. 

Proof. We split w A into a low modulation part and a high modulation part, 

u\ = Q<\zu\ + Q>\2U X 

The first term is estimated in simply by counting dyadic regions with 
respect to modulation. The time truncation regularizes the modulation less 
than 1, so we are left with about log A regions. 

On the other hand for the second term we use the I 2 summability with 
respect to rectangles of size A~ 2 x A -1 . Precisely, via Bernstein's inequality we 
have 

||Q>A 2 «a||pl°° < -^II m a||v a 2 

It remains to show that 

where R is a rectangle as above. By the definition of U 2 the problem reduces 
to the case when solves the homogeneous Schrodinger equation. But in that 
case the above inequality is nothing but the classical local smoothing estimate. 

□ 

4. The cubic nonlinearity 
In this section we prove Proposition 11.31 

For a dyadic frequency A we estimate the nonlinearity \u\ 2 u at frequency A 
in a A 4s time interval /. We take a dyadic decomposition of each of the factors 
and denote the corresponding frequencies by Ai, A 2 , A 3 . We consider several 
cases: 

Case 1. Ai 2,3 < A. Then the X s bounds at the Xj frequencies are localized 
to time intervals at least as large as /. Hence we use directly the L 6 Strichartz 
estimates to obtain 

A s ||x/Uai«A2«a 3 ||iw| ^ A 3s ||x/m Ai m A2 m A3 || L 2 

< (AiA 2 A 3 )~ s A 3s ||w Al \\u\ 2 \\x* \\ux 3 \\x* 

The summation with respect to the A/s is straightforward. 
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Case 2. max{Ai, A2, A3} = fi ^> A. In order to have any output at frequency 
A we must have at least two A^'s of size /i. Hence we can assume that 

{Ai, A 2 , A 3 } = {a, /j,, /j,} a<n 

We consider two possibilities: 

Case 2a. a < A <C fi. We begin with the bound 

(!8) 1 1 -Pa(X[o,i]*>Ai v\ 2 vx-MduI < ^ *°g A|K IKHt/f 2 II ^a 3 1 1 ul z 
By duality this is equivalent to 



</i 1 log A||-wai lit/? \\vx 2 \\u'( hxAul IMIvf 



which follows from the bilinear L 2 estimate for the factors u a u^ and X\o,i] u \ u ^- 
The frequency fi functions are only controlled on /i 4s time intervals. Hence 
we need to use f lT8l) on each such time interval and then sum up the output 
from about A 4s /i _4s such intervals. For each interval | J\ = A~ 4s we obtain 

\ s \\Px(xjUx 1 ux 2 ux 3 )\\du 2 x ^ a~>~ 2s A 5 >" 4 > _1 log A||w Al \\ x » \\u X2 \\x» ||«a 3 IU s 

Then we sum this up with respect to a and /1. This imposes the restriction 
s > — I but only due to very large values of \i. We note that we gain almost 
1 + 2s derivatives in this computation. 

Case 2b. a ^> A. For later use we summarize the result in this case in the 
following 

Lemma 4.1. Let I be an interval of length X 4s . Set 

f = P\Xi >J U Xl Ux 2 Ux 3 



Ai,A2,A33>A 



Then we have the estimates 



IIQ>A 2 /|l x o,-^ < A- 1 - 3 !^ 

respectively 

Remark 4.2. TTie same estimates remain true, and in fact become easier, if 
we replace by ux 2 ■ 

We notice that due to the embedding (J7]) and to Corollary 13.41 Lemma 14.11 
implies that 



A s ll flU r2 < A- 1 - 2 *! 



u 



13 

I X s 



which is a gain similar to the one in Case 2(a). Then the proof of Proposi- 
tion [L3] is concluded. 
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Proof of Lemma 4TT To understand the main feature of this case we denote 
by (rj,£j) the frequencies for each factor and by (r, £) the frequency of the 
output. Then we must have 

6 + 6 = 6 + 6, ri + r 3 = r 2 + r 

This yields 

(ri - - (r 2 - 62) + (r 3 - $ - (r - £ 2 ) = 266 - 266 

Since the size of the frequencies {6 6; 6; 6} is {A, ct, //} with A <C a < /j 
we conclude that 

\n ~ 61I + l r 2 - £2 1 + l r 3 - 63I + |r - 6 2 | > a/i, if A 2 = 
respectively 

|ri - 6 2 | + \T2 - 62I + \r 3 - 6 2 | + k " > /A if A 2 = a 

This shows that at least one modulation has to be large, namely at least a fx. 
To take advantage of this we split each factor into a low and a high modulation 
component. There are several cases to consider: 

Case I. This is when we have three small modulations. Then the output 
has large modulation. Depending on whether the conjugated factor has lower 
frequency or not we divide this case in three: 

Case 1(a) Here we consider the first component of /, namely 

fx = ^2 ^ = Yl P A(Q«^(X/M M )Q«^(X/M/,)<5« QM (X/M a )) 

Then is localized at modulation ol\x. We begin with an L 2 bound for the 
triple product Pxiv^v^Va). We claim that 

( 19 ) \\P\{v^V^V a )\\ L 2 < A'^ll^llc^ll^llujIIVallt^ 

Indeed, using the energy bound for and the bilinear L 2 bound for v^v a we 
obtain 

Applying P\ the estimate (fT9l) follows from Bernstein's inequality. 

To use ffT9l) in order to bound fx we decompose each factor w M , respectively 
U\ with respect to time intervals of length /x 4s , respectively a As and apply ffT9l) 
for each combination. The contributions of /z 4s separated intervals is negligible 
since the kernel of Q<^ a ^ decays rapidly on the (a/i)" 1 timescale. Hence there 
are about A 4s /i~ 4s contributions to add up. We obtain 



WfHW < A 4 >- 4s «->- 2s A^^||^|| xs || Mm || xs || Mq || xs 
Since f± a has modulation an this gives 

\\fr\\ x o,-^ < A^ +4a ^" 1 " 6 *«"'" a ||^IU.||^l|x-||w«||x. 

13 



The summation with respect to the dyadic indices a and /J is straightforward 
provided that s > — |. We obtain 

Case 1(b) The second component of / is 

h = Y = Y P A(<5« At 2(X/M M )Q« At 2(X/M Q )Q« M 2(x/M At )) 

Then is localized at modulation /i 2 . We can still use (1191) since the location 
of the complex conjugates does not matter. Hence satisfies the same L 2 
bound as However, because of the larger modulation we obtain a better 
X ' - ^' 1 bound, namely 

After summation with respect to a and fi we obtain the same bound for f 2 as 
for fi, the difference is that the summation can be carried out for s > — ^. 
Case 1(c) The third component of / is 

h = Y fa = Y P A(Q« M 2(x/M M )Q <<M 2(x/M M )Q« /t 2(x/M At )) 

Then is localized at modulation /i 2 . We claim that flTTJT) still holds. To 
prove this we first observe that in order for the output to be at low frequency 
A, two of the frequencies £1,— £2^3 must be fi separated. Then we use the 
bilinear L 2 bound for those two factors, and the energy bound for the third. 
By ( 1191) we obtain as in Case 1(a) 

II/3 ll x o,-*,i < ^ +4S V~^ 7S \\uJxAWJx4uJxs 

and the summation with respect to /x can be carried out for s > — A. 

Case II. This is when at least one factor has large modulation. Depending 
on which factor has large modulation and on whether the conjugated factor 
has lower frequency or not we divide this case in six: 

Case 11(a). Here we consider 

fa = Y ft* = Y P ^XlQ>af,(XlU^)u^U a ) + PxiXlU^Q^iXlU^Ua) 

The two terms are similar, so we restrict our attention to the first one. Our 
starting point is the bound 

(20) llQ^VV'alU 1 ~ ""^"NKvll^ll^ll^H^llf/a 

which is obtained from the L 2 estimate for the first factor and a bilinear L 2 
estimate for the remaining product. 

Low modulation output: By Bernstein's inequality (1201) implies 

(21) IIQ^vvMi^ < A'a'V^IIMi^llM^IWIi/s 
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Summing up fl2TT) over A 4s /i 4s time intervals of length /i 4s we obtain 

H/flU 1 ^ < a~ s ^X^^X^a-^- 1 \\u^\\x4 u A\x'\\ u a.\\x- 
For s > — | we can sum this up with respect to a and A to obtain 

Intermediate modulation output: Consider now the X '"^- 1 estimate 
at modulations A 2 < cr < a/x. From (120]) and Bernstein's inequality we obtain 

(22) ||Q ct Pa(Q^V^")IU 2 ~ (Ao r )5a _ ^ _1 ||^||c/2||^||c/2|ka||c/2 

The kernel of Q a is rapidly decaying off diagonal on the a -1 scale. Then in 
estimating the sum over /x 4s intervals there is a gain coming from the fact that 
we only need square summability with respect to intervals of size cr -1 . We 
consider two cases. 

a) If cr -1 < /i 4s then we need square summability with respect to intervals 
of size /i 4s so we obtain 

HQ^/r"!!^ ^ a f --^-^A 2 >-^(Ao-)5 CK -5^- 1 l|w M ||^.||^|| je .|| tA „||^ 

or equivalently 

\\Q*f7\\ x o,-^ < A5 + 2- a -'-V 1_4 'II^IU.||WA t ||x.||«a|k. 

Adding up with respect to a yields 

\\QJn\ x o,-hi < A3 +Jta a-3-V 1 ~ 4 'ln^||« M |U.||%IU.||««|k« 

<r=max{ A 2 4s } 

and now the summation with respect to a and /x is straightforward for s > — \. 

b) If a -1 > /x 4s then we need square summability with respect to intervals 
of size cr -1 so we obtain 

WQvfnWv ~ a-v^A^a^^^^Aa)^-^- 1 !!^!!^!!^!!^!!^!!^ 

or equivalently 

IIQ.ATII^,-^ < ^^A^^^-v^ll^n^ll^ll^ll^ll^ 

Adding up with respect to a gives 

WQ^T\\ x o,-li ^X-^a^-'fi-^WuJxsWuJxAMlxs 

<X=A 2 

In this case the summation with respect to a, /x gains —2 + 4s derivatives, 
which is better result than needed, but the summation requires s > — |. 

High modulation output: Here we estimate the output at modulations 
o 3> a\i. In order to obtain such an output at least one of the factors must 
have modulation at least a. Without any restriction in generality we assume 
that this is the first factor, as the other cases are considerably simpler. This 
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gives a trivial improvement in the L 2 bound for the first factor, so instead of 
(J22D we have 

(23) \\QcPx{Q> a ^v^%v a )\\ L 2 < (Xa^a'^/i'^Wv^Wu^Wv^Wu^WvaWug 
Then the bound in case (a) above is replaced by 

WQvf^Wtf < a- s /i- 2s X 2s /i- 2 \Xa)^a^ii-^\u^\x4 u ^\xA\ u a\\x^ 
or equivalently 

WQofZtWxe-tA £ ^ +2s a- s ^ s a^\\uJ X s\\uJ X s\\u a \\ Xs 

which has better summability with respect to large a. 

Case 11(b). This is when the low frequency factor has high modulation. 
We consider terms of the form 

fs = ft* = S P ^(XlU^Uj:Q>an(XlUa)) 

Depending on the relative size of a and \i we divide the problem into two 
sub- cases: 

Case II(b)-l. X\i < a 2 . By orthogonality we can assume that the two 
factors are frequency localized in a separated intervals of length a. Then we 
use the bilinear L 2 bound for their product and the L 2 bound for the high 
modulation factor to obtain a weaker analogue of (120]) . namely 

(24) \\P\{v^Q> a ^v a )\\ L i < a~V~*IMI^IWItfjill v «lli/i! 

Low modulation output. Compensating for the weaker bound (1241) . in 
this case there is an improvement in the summation over time intervals. We 
decompose the X 4s time interval I in two steps. First we split it into X is a~ is 
time intervals of length a As , which gives a X As a~ As factor in the summation. 
Secondly we split each a is time interval into a 4s /i _4s time intervals of length 
/i 4s . Since the frequency a factor is square summable with respect to this par- 
tition, by Cauchy-Schwartz this gives only an a 2s /j,~ 2s factor in the summation. 
We obtain 

II /"Ik 1 £ a-'fi-^X^a-^fi-^a^fi-^Wu^WxsWuJxAMx' 
which by Bernstein's inequality implies that 

This is summable with respect to large \i only if s > — ~. However, the restric- 
tion A/i < a 2 improves the summation. Assuming s < — | the \i summation 
yields 

E||n f&PW <T \l+8s -2-llsli ||2 |i II 

Hv<a 2 / 5 Wl^l 2 ^ X x a \M\x*\\ u a\\x° 

o</i<A _1 « 2 

which is summable with respect to a for s > — jr. 
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Intermediate modulation output. A 2 < a < ct/j,. Here we argue as in 
Case 11(a) but using ([21]) instead of (12"U1) . From (T21]) and Bernstein's inequality 
we obtain 

(25) \\QaP\{v^%Q> a ^v a )\\ L 2 < (Ao03a~V~ 5 IIM^IIMt>2lMlu2 

We split this again depending on a but also taking into account the improved 
summability up to the a 4s time scale, as discussed above for the case of low 
modulation output. 

a) If a -1 < /x 4s then due to the square integrability of u a in each X Aa time 
interval we have an interval summation factor A 2s ct~ 2s . Hence 

which yields 

\\QafT\\ x o,-^ < ^a- x -^^-^\K\\xA\u^\xA\u a \\x' 

The summation with respect to a, a and \i is straightforward for s > — ■?. 

b) The case A 2 < a < /i~ 4s is somewhat worse because the kernel of Q a 
decays only on the a -1 scale which is now larger than /i 4s . Hence inputs from 
/i 4s time intervals within each a -1 time interval are no longer orthogonal. This 
yields a weaker interval summation factor, namely \ 2s a~ 2s a~2 fi 2s . Hence 

IIQai-TIU 2 S a->- 2s A 2s a- 2 V-5 M - 2s (Aa)^o;"V"^l^l|x S ||^IU=lka|U» 
which yields 

The summation with respect to o is straightforward: 

(T=A 2 

However, in the a and \x summation we need to use the restriction AyU < a 2 
exactly as in the case of low modulation output. 

High modulation output: Here we estimate the output at modulations 
a ^> afi. Then we can assume that the last factor has modulation at least a 
therefore it satisfies a better L 2 bound, which leads to 

(26) \\QaP\{v^%Q> a ^v a )\\ L 2 < X^a^Wv^WuiWv^WuiWvaWui 
Then instead of the estimate in case (a) above we obtain 

WQaf^Wtf < a- s ^~ 2s \ 2s a~ 2s \^a-^\u^\\ x 4u^\\ X s\\u a \\x^ 
or equivalently 

||<3<7./5 a l x o,-i,i < ^ +2s d'^ s ^ 2s a-^\\u^\\xs\\u^\\xA\u a \\x^ 
and hence 

WQ^f^Wx^ <A- 1 / 2+2 *a- 1 - 3 V"'" 4 'II^IU.||^||x.||u«IU.. 
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The condition A/x < a 2 is again needed. 

Case II(b)-2: A/x > a 2 . Then the arguments in the previous case fail to 
provide enough decay in order to insure summability for very large /x. 

Low modulation output. In this case we are able to establish the follow- 
ing improvement of 

(27) \\Q<tfP\(v^Q> a ^v a )\\ L i L 2 < ^"IivII^IMIl^IMIc/* 

The rest of the analysis is similar to the computation in Case II(b)-l. The 
only difference is that here we gain an extra factor of a(A/x)~2 < 1 5 which 
improves the summation for large /x. 

To prove (f2"TI) we only use the L 2 bound for v a . Then, using the atomic 
decomposition for each of the two factors, we conclude that it suffices to 
prove (j27j) in the case when both factors solve the linear equation. By 
orthogonality we can assume that both are frequency localized in a intervals 
which are a separated. Then we use the L 2 bound for the product of u^u^, 

However, due to the frequency localization we also obtain that the product 
is Fourier localized in a thin rectangle R of size a 2 //x x a/x at slope /x -1 . Next 
we consider the product 

which we view as a product of two L 2 functions with different Fourier local- 
izations. The product is only estimated in a Fourier rectangle of size A x A 2 , 
therefore by orthogonality it suffices to estimate the product assuming that 
both factors are Fourier localized in rectangles Ri, R2 of similar size. The in- 
tersection R = RnRi is a shorter rectangle of size a 2 / /x x A 2 . Our assumption 
a 2 < A/x insures that Rq is essentially vertical. But by Bernstein's inequality 
we have the pointwise bound 

lblU 2 L°° ^ a/^IMU 2 , supp5fC.R0 

therefore (|27p follows. 

Intermediate modulation output, A 2 < a < a/x. Then a similar argu- 
ment applies. Ro has size a 2 / fi x a, which yields the pointwise bound 

IblU- ^ atfr*(Ti\\g\\&, supp g C R 
This in turn leads to 

(28) \\QaP\(v^Q> a ^v a )\\ L 2 < o-^al/x^ll^ll^ll^ll^ll^H^ 

which is again an improvement of a(A/i)~^ over the similar computation in 
Case II(b)-l. 

Large modulation output, o ^> a/x. Then we can assume that the third 
factor has modulation at least a. Also R has size a 2 //x x a/x, therefore 

IMk°° ~ a^lMU 2 , supp g C R 
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which implies that 



(29) \\QaP\iy^v^Q> a v a )\\ L 2 < a sall^J^II^II^II'^ll^, 

an improvement of at least a(A/i)~5 over (|2"6T) . The conclusion follows in a 
similar fashion. 

Case 11(c). This is when the low frequency factor is conjugated but does 
not have high modulation. We consider terms of the form 

h = Y /tT = Y P\(XiQ>r>(Xi u f*)uZ u ii) 
a<s«<^ A<ca</^ 

If a <C (jl then the last two factors are /i separated in frequency. But even if 
a ~ fi, in order for the final output to be at frequency A the two last factors 
must still be fi separated. Then we can use the trilinear bound 

(30) WPxiQ^xuVftV^v^WiA < a^V^lM^IMMMIt/a, 

obtained by estimating in L 2 the first factor and the remaining product. 

The constants here are better than the ones in Case 11(a), and the rest of 
the argument proceeds as there without any significant changes. 

Case 11(d). This is when the low frequency factor is conjugated and has 
high modulation. We consider terms of the form 

h = Y ^ = Y P ^(Xiu l ,Q> tJ ,2(xiu Q )u^ 

In order for the final output to be at frequency A the two frequency fi factors 
must still be fi separated. This leads to the trilinear bound 

( 31 ) \\Px(v^Q> al j,VaV^)\\Li < fi~* \\v^ \\ V 2 \\v a \\ V 2 , 

and the argument is completed again as in Case 11(a) but with better constants. 

Case 11(e). This is when all frequencies are equal and the conjugated factor 
has high modulation. We consider terms of the form 

h = Y ^ = Y p *(Xi u hQ>i* , (Xi u p) u p) 

In some sense this is the worst case because we cannot enforce any frequency 
separation between the two unconjugated factors. We still want to gain some 
power of fi in order to have summability for large fi. At least to some extent we 
can do this by the lateral Strichartz estimates in Corollary I3.3( bc) to obtain 

5 

( 32 ) IIVS^vvlLf rl ^/^IMe^fJi^IMI^ 

i<x L t 

This is done for instance by using the L^Lf bound for one factor, respec- 
tively the L\Lf for the other t> M factor. 
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Low modulation output: After summation with respect to fi 4s time in- 
tervals (1521) gives 

WQotPxiunQfrpUpUn) || 4 < A 4 >- 4 V 3 >" f ll^ll^ 

which is easily summed up with respect to /x for s > — Jj. 

Intermediate modulation output: A 2 < o~ < /i 2 . ^From fl32l) combined 
with Bernstein's inequality we obtain 

(33) WQaPkiv^Q^V^V^Wtf < cr5A3/i"t \\vju2 ||^||c/2||^||[/2 

Adding this up with respect to fi 4s time intervals yields 

||Q^(v?>M 2t vv)IU 2 ~ ^ +4s ^/^~ 7s IKII|s 

or equivalent ly 

||Q CT PAKQ>^M M )|| x o,-i,i < ^ +4 >-2-7*|| M j3^ 

which is easily summed up with respect to a and \i for s > — J^. 

Finally, the output modulations which are larger than /i 2 are treated as in 
the first case. 

High modulation output: a ^> fi 2 . Without any restriction in generality 
we assume that the second factor has modulation at least a. Instead of (1331) 
we get 

(34) \\QaP\{v^Q> a v^v^)\\ L 2 < X*/j,~* ||u M ||o2 \\ v v\\v> htxWui 
Adding this up with respect to \ 4s fi~ time intervals yields 

WQaPxiu^QZaU^Wtf < \^ +4s fX-^~ 7s \\ U J 3 X s 

or equivalently 

The summation with respect to o and /i requires again s > — ^. 

□ 

5. The energy conservation 
It remains to study the conservation of the H s energy. We first set 

E {u) = (A{D)u,u) 
For the straight H s energy conservation it suffices to take 

However in order to gain the uniformity in t required by (fTUj) we need to allow 
a slightly larger class of symbols. 
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Definition 5.1. Let s G 1 and e > 0. Then is the class of spherically 
symmetric symbols with the following properties: 

(i) symbol regularity, 

\d a a{t)\<a{t){l + er a/2 

(ii) decay at infinity, 

Here e is a small parameter. 

We compute the derivative of E along the flow, 

j t Eo(u) = R 4 (u) = 2$l(iA(D)u, \u\ 2 u) 
We write R 4 as a multilinear operator in the Fourier space, 



R 4 (u) = 23? / ia(&)fi(£i)«(&)«(&)«(&)<fo- 
J Pi 

where 

^ = {^1 + 6-6-^4 = 0} 

This can be symmetrized, 

R 4 (u) = ~K [ i(a(£i) + a(6) - «(6) - a(6))w(6)«(6)w(6)«(k)<fo- 
^ </p 4 

Following a variation of the /-method, see Tao [9]-3.9 and references therein, 
we seek to cancel this term by perturbing the energy, namely by 

E 1 (u)= [ 6 4 (6, 6, 6, U)HZi)u(&)u(&u(U)do- 

J P A 

To determine the best choice for B we compute 

jEM= I Mti, 6, 6, 6)(£i 2 + £ 2 2 - si - 6 2 )^(6)^(6)^(6)^(6)^ 

+ J R 6 (w) 
where Rq(u) is given by 

■/6+6-f3-f4=o 

To achieve the cancellation of the quadrilinear form we define 64 by 
(35) 

h (t c c n a(6) + 0(6) - - a(6) , t tf t ncP 

°4VS1) ?2) S3, S4j — 72 ~" f 2 12 72 ' Isl) S2, S3, S4j C -r 4 

SI + S2 S3 S4 

Summing up the result of our computation, we obtain 

(36) j t (E (u) + E^u)) = R 6 (u) 
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In order to estimate the size of E-y{u) and of i? 6 we need to understand the size 
and regularity of b. A-priori b is only defined on the diagonal P4. However, in 
order to separate variables easier it is convenient to extend it off diagonal in 
a smooth way. 

Proposition 5.2. Assume that a £ S f s with s + e < 0. Then for each dyadic 
A < a < /i there is an extension of 64 from the diagonal set 

{((1,6,(3,(4)^4, |(i| ~ A, |( 3 |~ a, |6|,|6l~/^} 
to the full dyadic set 

{ 161 ~ A, |6l wa, 161 
which satisfies the size and regularity conditions 

(37) |9f ^ 2 (6, 6, 6, 6) I < a(A)«-V" 1 A-' 3l a-' 3 >- ft -' 34 

i/ere i/ie implicit constants are independent of \,a,fi but may depend on the 
(3 j 's. 

Proof. We first note that on P 4 we have the factorization 

6 2 + ( 2 2 -6 2 -£ 4 2 = 2(6-6)(6-6) 

along with all versions of it due to the symmetries of P 4 . We consider several 
cases: 

(a) A C a < /1. Then the extension of 64 is defined using the formula 

. ( t , , t\ - a (6)+«(6)-«(6)-«(6) 
M6, 6, 6, 6) - 2(6-6X6-6) 

and its size and regularity properties are straightforward since |6 — 61 ~ a 
and |6 — 61 ~ t 1 - 

(b) A ~ a <C fi. Then the extension of 64 is defined using the formula 

hip t p t\ q(6) - q(6) q(6) - q(6) 
^1,^2,^3,^ - m _ mi _ u) m _ _ u) 

Now only |6 — 61 ~ t 1 is an elliptic factor, while the remaining quotients 
exhibit suitable cancellation properties. 

(c) A « a « /i. Then the extension of 64 is defined by 

, ( , , , c , a(6) + a(6)-a(6 + 6-6)-a(6) 

04Ui>6,6,6J ^77 t\u T~\ 

46 - 6X6 - 6J 

To see that this is a smooth function on the appropriate scale we write it in 
the form 

, ( t , , , n 1 /a(6)-a(6) a(6) -a(6 + 6 -6)~ 

°4(,6,6,6,6J 



2(6-6) V 6-6 6-6 

g(6, 6) - g(6 + (6 - 6), 6 + (6 - 6)) 

2(6-6) 
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where q is the smooth function 

q(0 - q(v) 



□ 



The contribution of E 1 to the energy is easy to control, 
Proposition 5.3. Assume that a £ S% with — |<s — e<s + e<0. Then 
(38) ^(tOI <E (u) 2 

We note that the threshold s = — \ in the proposition is consistent with the 
scaling. 

Proof. We organize the four frequencies £1,62, £3 and £4 in dyadic regions of 
size \ < a < fi = fi. The pointwise bound on b is all we need for the proof 
since in the Fourier space one sees that only the size of the Fourier transform 
matters. For a function u we define u by u = We obtain 

\El(u)\ < ^ |£l(wA,W a ,W M ,M M )| 
A<a<^i=/i 

< ^ a(A)crV -1 ||uA«aVVl|i,i 

A<a</i=/i 

< ^ a (^) a_ V _1 ||%IUHI^a|UH|TVlli 2 
A<a</i=/i 

< ^ ^a{X)a~^fi' 1 \\ux\\L^\\u a \\ L 4^Ah 

X<a<fj,=ix 

where at the last step we have used Cauchy-Schwartz with respect to all param- 
eters. Since s—e > — | it follows that the function Aa(A) increases polynomially 
with respect to A therefore the last sum is finite. 

□ 

The more difficult result we need to prove is 

Proposition 5.4. Assume that a G S% with s + e < and s > — |. Then we 
have 

■I 



(39) 



R 6 (u)dxdt 







< ||«||5r. 



We note that the restriction on the symbol a above is very mild. This is 
because, as one can see in the proof below, there is always a low frequency 
gain in the estimates. The main condition s > — | arises in the summation 

with respect to high frequency factors. 
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Proof. We consider a full dyadic decomposition and express the above integral 
in the Fourier space as a sum of terms of the form 
-i 



/ = / / fc4(6,6 ) 6,^o)%x(6)«A 2 (6)wA 3 (6)-PAo(wA 4 (^4)wA 5 (6)«A 6 (6))^^ 
JO JP 6 

where 

P e = + & + & = & + & + & = & - 6 + 6 

Since 6 is smooth in each variable on the corresponding dyadic scale we can ex- 
pand it in a rapidly convergent Fourier series and separate the variables. Hence 
from here on we replace b by the pointwise bound given in Proposition 15.21 
There are two cases to consider: 

Case 1: Ao <C ^4,5,6- Then for the frequency Ao factor we use Lemma [4.11 
We denote 

(Ai, A 2 , A 3 , A ) = (A, a, /i, /i), A < a < fi 

and 

ho = P a (^a 4 (^4)«a 5 (6)ma 6 (6)) 

A4,5,6S>Ao 

We also recall the bound for 64, namely 

N < a(A)ct~V -1 

Case 1(a) Ao = [i. We consider the three possible terms in f\ . For the 
L l L 2 term we bound u\, u a in L°° and in L^L 2 . We also sum up with 
respect to /i 4s time intervals. This yields 

\A < ^ s \- s a- s [i- s \^a^[T l ' Zs a{\)a- l ii- l \\ut X3 

= X^- s a(\)a- s -^- 2 - 8s \\uf X s 

The summation with respect to A, /i and a is straightforward if s > — \. 

4 

For the L%L\ term in / we bound u\, u a in L°° and in L 4 L°°. This yields 

\I\ < ^ 4s A^a->- s A5a^3 / u-f- 3s a(A)a-V" 1 ||M||^ 

which gives the same outcome as in the previous case. 

For the L 2 part of / at modulation a ^> fi 2 we note that at least one other 
factor must also have modulation at least a. We bound that factor in L 2 and 
the other two in L°° to obtain 



\I\ < I 1 ^ s « V s a^\i^\i 1 3s a(X)a V 1 \\ u \ 



X s 



3 



|6 



= a(A)A s a s 2^ 2 8s ||m|| XS 

which is then summed with respect to A, \i and a provided that s > — |. 

Case 1(b) \ = a <^ fi. This case is simpler; As a consequence of 
Lemma 14.11 and of Bernstein's inequality we have the L 2 type bounds 

(40) M^\\Q< ail h \W + \\Q^h \\ x o,-ii < Ao 1_3s IMIx* 

which is all that we need in the sequel. 
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For the low modulation part of f\ we use the first part of ffjOj) . By orthog- 
onality we can localize the frequency fi factors to a intervals. Then we use the 
bilinear L 2 estimate for u\u^ and the pointwise bound for the other factor. 
This gives 

\I\ < «- 2 >- 2s A->- 2s a5 / i-5 a - 1 -3 s ( ayU )5a(A)a-V~ 1 |hllx- 
= a(A)A- s a- 1 - 5 V 1 - 4s ||M||^ 

The factor a~ 2s fi~ 2s above comes from summation over small time intervals. 
This is better than the earlier /i~ 4s factor because Q< ai if\ is square integrable 
on the better a As time scale. This is summable with respect to A, a and /i if 

s > 

For the L 2 part of f\ at modulation a > a/i we note that at least one other 
factor must also have modulation at least a. We bound that factor in L 2 and 
the other two in L°° to obtain 



\I\ < a~ 2s fi' 2s X' s fi~ 2s aa~ 1 ~ 3s a(X)a' 1 fi' 1 \\u\ 



X s 



i 



which is the same result as above. Note that only an Q2 factor is lost in 
the pointwise bound for due to the additional frequency localization to an 
interval of size a. 

Case 1(c) A = A C a. For the part of f\ with modulation < a\i we 
bound u a Un in L 2 and the other in L°°. This works even if a ~ fi as two 
of the n sized frequencies must be /i separated. We obtain 



\I\ < A- 2 >~ 2s a">" 2s a^"^- 1 - 3s (a / u)2a(A)a-V _] 
= a(A)A~ 5 ^ 1 a~>- 1 - 4s ||?i| 



u\\ xs 



which can be summed up for s > — |. 

If we consider the part of f\ with modulation a ^> a/i then another factor 
must have modulation at least a. We bound that factor in L 2 and the other 
two in L°° as in Case 1(b). 

Case 2: A > min{A 4 , A 5 , A 6 }. Without any restriction in generality we 
assume that 

Ai < A2 < A3, A4 < A5 < A6 

Then we must have 

A4 < Ao < A3 

We can distribute P\ to each factor and also assume that the A5, A6 factors 
have frequency spread at most Ao- 
Denote 

{Ai, A 2 , A 3 , A } = {A, a, A < a < /i 
Case 2a: A = /i. 

Case 2a(i): A 5 = A 6 ^> /i. We use the bilinear L 2 estimate for the products 
u\U\ s and u\ 4 u\ 6 and the L°° bound for u\, u a and add up with respect to 
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A 6 4s time intervals. We obtain 



\I\ < A6 4s A- s a->- 2s A 6 - 2s A5a^- 1 a(A)a-V' i nil^J^ 

< a(\)\*-'a-'-t f i- 2 '-% l - 6s J] \\u Xj \\ x . 

which we sum easily with respect to the parameters Xj subject to the restric- 
tions above. We note that the summation with respect to A5 = Xq requires 
imposes the tight restriction s > — |. 

Case 2a(ii): X$ < Xq = fi and a <C Then we use the pointwise bound 
for u\, the bilinear L 2 estimate for u^u^ and the L 6 Strichartz estimate for the 
remaining two factors; finally, we sum up with respect to /i~ 4s time intervals. 
We obtain 

\I\ < /^ _4s A _s a~> _4s A5/i~5a(A)a _ V _1 |hA||x-||Ma|U=lh M ||x»lhllx- 

< a(X)X^ s a' s ' 1 ^^ 8s \\u x \\ X s\\u a \\xA\ u AxA\ u \\x^ 

The summation with respect to A, a and /i requires s > ^jr. 

Case 2a(iii): A5 < Xq = /i and a — //. Then we use the L 6 Strichartz 
estimate for all the factors to obtain 

|/| < a(A)/i- 4 >- 6s /i- 2 ||M||^ = a(A)/i- 10s - 2 || M ||^ 

Case 2b: A = a <C [i. 

Case 2b(i): X^ = X^~^> fi. Then we use the bilinear L 2 estimate for u^ux^ 
and u^u\ 6 and L°° for u\, u\ 4 and add up with respect to Ag s time intervals. 
We obtain exactly the same bound as in Case 2a(i). 

Case 2b(ii): A 6 < A 7 < fi. Then we are in the same situation as in Case 
2a(ii). 

Case 2c: A = A /i. Then we can argue in the same way as in Case 2b. 

□ 

The final step in the paper is to use Proposition 15.41 in order to conclude the 
proof of Proposition 11.41 We have 

NHh. =^A 2 iu A||i 2 

A 

Then the following result is straightforward: 

Lemma 5.5. There is a sequence {/?a} with the following properties: 
(l) A 2s ||MoA||i 2 < f3\\\uo\\ 2 H s. 

(Hi) P\ is slowly varying in the sense that 

I log 2 (3\ - log 2 Pn\ < -| log 2 A - log 2 fi\ 
The sequence f3\ is easy to produce. One begins with the initial guess 

\ 2s 1 1 _ , 112 
nO _ A IPOAjlx 2 
Px ~ \\v II 2 
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which satisfies (i) and (ii) but might not be slowly varying. To achieve (iii) we 
mollify on the dyadic scale and set 

p x = 2~^H log2 A ~ lo §2 n p° 

The sequence (3\ will play the role of frequency localized energy threshold. 
Precisely, we assume that 

(41) \\uWplool2 < 1. 
and we will show that 

(42) supA^|K (t)|| L2 < pI(\\uo\\h* + \\u\\ 3 X s) 

which by (ii) implies the conclusion of Proposition 11.41 

In order to prove (|42|) for some frequency Ao we define the sequence 

a x = A 2s max{l,/3^ 1 2- e l InA - lllAo l} 

We obtain using the slowly varying condition (iii) 

E a ( A )ii^ni 2 ^ E a2s H m oaII| 2 +2^ |inA - inAo| A 2 ^- i n MoA ni 2 < Km,. 

A A 

Correspondingly we find a function a(£) G S* so that 

o(0 « o(A), |e| « A 

^From (j4ip we obtain sup t E (u(t)) <C 1. Now we use the energy estimates in 
Proposition 15.41 for this choice of a. By Proposition 15.31 the E\ component of 
the energy is controlled by Eq, so we obtain 



^a(A)|Mt)||^ 2 < \\uo\\ H ' 



it: 



A 



which at A = A gives 
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